The TIC: Parsing Interesting Text
نویسنده
چکیده
This paper gives an overv iew of the na tu ra l language problems addressed in the Traffic In format ion CoHator /Condenscr (TICC) p ro jeer, and describes in some deta i l the "interest ing-corner parser" used in the TICC's Na tura l Language Summariser . The TICC is designed to take free text input describing local traffic incidents, and au tomat i ca l ly ou t put local traffic in fo rmat ion broadcasts for motoris ts in appropr ia te geographical areas. The "interest ing-corner parser uses both syn tactic and semantic informat ion , represented as features in a unif icat ion-based grammar , to guide i ts b i -d i rec t ional search for significant phrasal groups.
منابع مشابه
برچسبزنی خودکار نقشهای معنایی در جملات فارسی به کمک درختهای وابستگی
Automatic identification of words with semantic roles (such as Agent, Patient, Source, etc.) in sentences and attaching correct semantic roles to them, may lead to improvement in many natural language processing tasks including information extraction, question answering, text summarization and machine translation. Semantic role labeling systems usually take advantage of syntactic parsing and th...
متن کاملEvaluating a Statistical CCG Parser on Wikipedia
The vast majority of parser evaluation is conducted on the 1984 Wall Street Journal (WSJ). In-domain evaluation of this kind is important for system development, but gives little indication about how the parser will perform on many practical problems. Wikipedia is an interesting domain for parsing that has so far been underexplored. We present statistical parsing results that for the first time...
متن کاملEffective Classification of Text
Text mining is the process of obtaining useful and interesting information from text. Huge amount of text data is available in the form of various formats. Most of it is unstructured.Text mining usually involves the process of structuring the input text which involves parsing it, structuring it by inserting results into a database, deriving patterns from the structured data, and finally evaluat...
متن کاملLearning Efficient Parsing
A corpus-based technique is described to improve the efficiency of wide-coverage high-accuracy parsers. By keeping track of the derivation steps which lead to the best parse for a very large collection of sentences, the parser learns which parse steps can be filtered without significant loss in parsing accuracy, but with an important increase in parsing efficiency. An interesting characteristic...
متن کاملLearning Efficient Parsing
A corpus-based technique is described to improve the efficiency of wide-coverage high-accuracy parsers. By keeping track of the derivation steps which lead to the best parse for a very large collection of sentences, the parser learns which parse steps can be filtered without significant loss in parsing accuracy, but with an important increase in parsing efficiency. An interesting characteristic...
متن کامل